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Synthetic rhgl gene 

1 

AATGGGAGGAGTGGGAAAGACAGTGGCTATGGAGCTTGTTCCGGAGGTTGGGTTGGAAT 

CAAGTGTGCTCAGGGACAGGTTATTGTGATCCAGCTTCCTTGGAAGGGTTTGAGGGGTC 

GAATCACCGACAAAATTGGCCAACTTCAAGGCCTCAGGAAGCTTAGTCTTCATGATAAC 

CAAATTGGTGGTTCAATCCCTTCAACTTTGGGACTTCTTCCCAACCTTAGAGGGGTTCA 

GTTATTCAACAATAGGCTTACAGGTTCCATACCTCTTTCTTTAGGTTTCTGCCTTTGCT 

TCAAGTCTCTTGACCTCAGCAACAACTTGCTCACAGGAGCAATCCCTTATAGTCTTGCT 

AATTCCACTAAGCTTTATTGGCTTAACTTGAGTTTCAACTCCTTCTCTGGTCCTTTACC 

AGCTAGCCTAACTCACTCATTTTCTCTCACTTTTCTTTCTCTTCAAAATAACAATCTTT 

CTGGCTCCCTTCCTAACTCTTGGGGTGGGAATTCCAAGAATGGCTTCTTTAGGCTTCAA 

AATTTGATCCTAGATCATAACTTTTTCACTGGTGACGTTCCTGCTTCTTTGGGTAGCTT 

AAGAGAGCTCAATGAGATTTCCCTTAGTCATAATAAGTTTAGTGGAGCTATACCAAATG 

AAATAGGAACCCTTTCTAGGCTTAAGACACTTGACATTTCTAATAATGCCTTGAATGGG 

AACTTGCCTGCTACCCTCTCTAATTTATCCTCACTTACACTGCTGAATGCAGAGAACAA 

CCTCCTTGACAATCAAATCCCTCAAAGTTTAGGTAGATTGCGTAATCTTTCTGTTCTGA 

TTTTGAGTAGAAACCAATTTAGTGGACATATTCCTTCAAGCATTGCAAACATTTCCTCG 

CTTAGGCAGCTTGATTTGTC ACTGAATA ATTTCAGTGGAGAAATTCCAGTCTCCTTTGA 

CAGTCAGCGCAGTCTAAATCTCTTCAATGTTTCCTACAATAGCCTCTCAGGTTCTGTCC 

CCCCTCTGCTTGCCAAGAAATTTAACTCAAGCTCATTTGTGGGAAATATTCAACTATGT 

GGGTACAGCCCTTCAACCCCATGTCTTTCCCAAGCTCCATCACAAGGAGTCATTGCCCC 

ACCTCCTGAAGTGTCAAAACATCACCATCATAGGAAGCTAAGCACCAAAGACATAATTC 

TCATAGTAGCAGGAGTTCTCCTCGTAGTCCTGATTATACTTTGTTGTGTCCTGCTTTTC 

TGCCTGATCAGAAAGAGATCAACATCTAGGCCGGGAACGGCCAAGCCACCCGAGGGTAG 

AGCGGCCACTATGAGGACAGAAAAAGGAGTCCCTCCAGTTGCTGGTGGTGATGTTGAAG 

CAGGTGGGGAGGCTGGAGGGAAACTAGTCCATTTTGATGGACCAATGGCTTTTACAGCT 

GATGATCTCTTGTGTGCAACAGCTGAGATCATGGGAAAGAGCACCTATGGAACTGTTTA 

TAAGGCTATTTTGGAGGATGGAAGTCAAGTTGCAGTAAAGAG ATTGAGGGAAAAGAT CA 

CTAAAGGTCATAGAGAATTTGAATCAGAAGTCAGTGTTCTAGGAAAAATTAGACACCCC 

AATGTTTTGGCTCTGAGGGCCTATTACTTGGGACCCAAAGGGGAAAAGCTTCTGGGTTT 

TGATACATGTCTAAAGGAAGTCTTGCTTCTTTCCTACATGGAAGGTTCGTGTGCTGGTT 

CTTTCATTAAAGTGTTGTGTGTGCTGGTCTTTAATTATAATTTGGAGTTTTACCTTAGT 

AATCTGTATAATTCTAATCGGAGAACAGTACAAACAAAAACACCTAAGGAACAACACCT 

TANCTTTAATATACCATATCAATAAAGTGAAATATTTTCTTGGTCATCTTGATGCAGGG 

GGAACTGAACATTCATTATTGGCCACA AGATTAAAATA GCCCAAGCCTTGGCCCGGGCT 

TGTTTGCCTTCATTCCCAGGAGAACATCATACATGGGACCTCNCATCCAGCAATGTGTG 

GCTTGATGAAAAACAAATGCTAAAATTCAGATTTTGGTCTTTTTCGGGTTGATGTCAAC 

TGCTGCTAATTCCAACGTGATAGCTACAGCTGGAGCATTGGATACCGGGCACCTGAGCT 

CTCAAAGCTCAAGAAAGCAAACACTAAAACTGATATCTACAGTCTTGGTGTTATCTTGT 

TAGAACTCCTAACGAGGAAATCACCTGGGGTGTCTATGAATGGACTAGATTTGCCTCAG 

TGGGTTGCCTCAGTTGTCAAAGAGGAGTGGACAAATGAGGTTTTTGATGCAGACTTGAT 

GAGAGATGCATCCACAGTTGGCGACGAGTTGCTAAACACGTTGAAGCTCGCTTTGCACT 

GTGTTGATCCTTCTCCATCAGCACGACCAGAAGTTCATCAAGTTCTCCAGCAGCTGAAG 

AGATTAGACCAGAGAGATCAGTCACAGCCAGTCCCGGGGACGATATCGTATAGCACAAA 

TTTTGCATTGATTTTTTTGTGCCAAATGTAGTAGGCCTACTATATATATGTTCTATGAT 
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TCTTTCATTCTTATATTATTTTTGCCTGTTTGAATGCTTGAATTTGTACATACTCATAC 
TACAATAAGGTGTAGTTCTGGTTAATTTTACCTCTACCTCAAAGCTGGGGTGTAATTCT 
GTTTCCTCCAAGGCACATAATAGTTGAAAATAGTTCTCAGGAGCATTCATTGTTTATTC 
TGCAAGATTCTCTTTCACGGCTGCTATCTTCTATGCATGCCCTGCCCATAAATGCATTA 
TGAAGAATTGTAACGGCTGTGTTTTTGGACTTCTTCAAAAAGTTTATGTTATTGCCAGG 
TGTATATATCAACATGTTTTAAAGATTTTCAAACAATCAGGTTTTAGATGTGGGTTTGC 
ATGCATGAGATTGGACTAGTGCGCTTGATGTAGTATAAAATATAAATTGTCCAATCAAG 
CACCCTCTACATGTCCAAATAATGGGCCTTATGAAACTTAATTTTTTAATTAC^AACTA 
CAGTAATCTTTTTGAATAAAGATTTACAAATTACAACNGACATGTGAAGCNGCATCTTT 
NATTGNCAATCTTTCAAGTTACTCTATTATTTTCTGCN 
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Rhgl Peptide 

NGRSGKDSGYGACSGGWVGIKCAQGQVIVIQLPWKGLRGRIT 

DKIGQLQGLRKLSLHDNQIGGSIPSTLGLLPNLRGVQLFNNRLG 

SIP 

LSLGFCPLLQSLDLSNNLLTGAIP 

YSLANSTKLYWLNLSFNSFSGPLP 

ASLTHSFSLTFLSLQNNNLSGSLPNSWGG 

NS KNGF FRLQNL I LDHNFFTGDVP 

ASLGSLRELNEISLSHNKFSGAIP 

NEIGTLSRLKTLDI SNNALNGNLP 

ATL SNLS SLTLLNAENNLLDNQI P 

QSLGRLRNLSVLILSRNQFSGHIP 

SSIANISSLRQLDLSLNNFSGEIP 

VSFDSQRSLNLSNVSYNSLSGSVP 

PLLAKKFNS S S FVGNI QLCGYSP 
STPCLSQ 

AP S QGV I AP P P E VS KHHHHR 

KLSTKDIILIVAGVLLWLIILCCVLLFCLIRKRS 

TSKAGNGQATEGRAATMRTEKGVPPVAGGDVEAGGEAGGKLVHF 
DGPMAFTADDLLCAT AEIMGKSTYGTVYKAILEDGSQVAVKRLR 
EKI TKGHREFE SEVS VLGKI RHPNGLALRAYYLGPKGEKLLVFD 
YMS KGGL LLFYMEGS CAGS F I KVLCVLVFNYNLEF YLSNLYNSN 
RRTVQTKTPKEQHLXFNI PYQ 

- SEIFSWSS - CRGN-TFI IGHKMKIXQDLAVACSPSFPETSYMD 
LXSSNVCX-NXMLKLQFWSFSVDVNCC-FQRDSYSWSIGIPGT- 
ALKAQE S KH -N - YLQS WCYLVRTPNEE I TWGVYEWTRFAS VGCL 
SCQRGVDK- GF - CRLDERC I HS WRRVAKHVEARFALC -SFSIS 
TTRSSSSSPAAGRD-TREISHSQSHLPGRPLEPYSESY 
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Score E 



Sequences producing significant alignments: 

pir tT46070 hypothetical protein T18N14.120 - Arabidopsis thaliana 
pir:T47727 hypothetical protein F18O21.60 - Arabidopsis thaliana 
pir:T04587 hypothetical protein F23E13.70 - Arabidopsis thaliana 
pir; T49038 hypothetical protein T5P19.20 - Arabidopsis thaliana 
pir: T48210 hypothetical protein T20M5.160 - Arabidopsis thaliana 
pir: T050S0 protein kinase homo log M3E9.30 - Arabidopsis thaliana 
pir:Tl853f receptor-like protein kinase - Ipomoea nil (Japanese... 
pir:T48489 receptor-like protein kinase - Arabidopsis thaliana 
pir: T10515 disease resistance protein Cf-2.2 - currant tomato 
pir: T10504 disease resistance protein Cf-2.1 - currant tomato 
pir: T30553 disease resistance protein Hcr2-5D - tomato 
pir: S27756 receptor-like protein kinase 5 (EC 2.7.1.-) precurso. . . 

pir: T48499 receptor-like protein kinase -like protein - Arabidop 

pir; T46033 receptor protein kinase-like protein - Arabidopsis t... 
pir: T05335 hypothetical protein F1C12.190 - Arabidopsis thaliana 
pir: T10636 hypothetical protein T13K14.100 - Arabidopsis thaliana 
pir: TQ5898 hypothetical protein F6H11.170 - Arabidopsis thaliana 
pir: T45717 receptor-kinase like protein - Arabidopsis thaliana 
pir: T0S322 hypothetical protein F18F4.240 - Arabidopsis thaliana 

pir: T10659 probable serine/ threonine- specif ic protein kinase (E 

pir; T03784 probable receptor protein kinase - rice 
pir ; T50851 receptor protein kinase homolog [imported] - soybean 
pir; T45647 receptor protein kinase-like protein - Arabidopsis t... 
pir: T457l8 receptor- kinase like protein - Arabidopsis thaliana 
pir: T50850 receptor protein kinase homolog [imported] - soybean 
pir: T45645 receptor kinase-like protein - Arabidopsis thaliana 
pir; T09356 brassinosteroid- insensitive protein BRI1 - Arabidops... 
pir; T00712 protein kinase homolog F22013.7 - Arabidopsis thaliana 
pir: A57676 protein kinase Xa21 (EC 2.7.1.-), receptor type prec... 

pir: S39476 kinase-like transmembrane protein TMKL1 precursor - 

pir; T02154 protein kinase homolog T1F15.2 - Arabidopsis thaliana 
pir; Tl0725 protein kinase Xa21 (EC 2.7.1.-) Al # receptor type 
pir: T0S897 protein kinase homolog F6H11.160 - Arabidopsis thaliana 
pir: T04313 protein kinase Xa21 (EC 2.7.1.-), receptor type - rice 
pir: T10727 protein kinase Xa21 (EC 2.7.1.-) D, receptor type - ... 
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>pir ; T46070 hypothetical protein T18N14.120 - Arabidopsis thaliana 
Length * 83 6 

Score m 632 bits (1613), Expect = e-180 

Identities - 329/550 (59%), Positives = 400/550 (71%), Gaps = 2/550 (0%) 
Frame = +1 

Query: 7 RSGKDSGYGACSGGWVGIKCAQGQVIVIQLPWKGLRGRITDKIGQLQGLRKLSLHDNQIG 186 

+S +S GW GIKC +GQV+ IQLPWKGL G I++KIGQL LRKLSLH+N I 

Sbjct: 72 KSWNNSASSQVCSGWAGIKCLRGQWAIQLPWKGLGGTISEKIGQLGSLRKLSLHNNVIA 131 

Query: 187 GSIPSTLGLLPNLRGVQLFNNRLTGSIPLSLGFCLCFKSLDIiSNNLLTGAIPYSLANSTK 366 

GS+P +LG L +LRGV LFNNRL+GSIP+SLG C ++LDLS+N LTGAIP SL ST+ 
Sbjct : 132 GSVPRSLGYLKSLRGVYLFNNRLSGSIPVSLGNCPLLQNLDLSSNQLTGAIPPSLTESTR 191 

Query: 3 67 LYWLNLSFNSFSGPLPASLTHSFSLTFLSLQNNNLSGSLPNSWGGNSKNGFFRLQNLILD 54 6 

LY LNLSFNS SGPLP S+ S++LTFL LQ+N1TLSGS+P+ + NG L+ L LD 

Sbjct: 192 LYRLNLSFNSLSGPLPVSVARSYTLTFLDLQHNNLSGSIPDFF VNGSHPLKTLNLD 247 

Query: 547 HNFFTGDVPASLGSLRELNEISLSHNKFSGAIPNElGTIiSRLKTLDISNNALNGNLPATL 726 

HN F+G VP SL L E+S+SHN+ SG+IP E G L L++LD S N++NG +P + 

Sbjct: 248 HNRFSGAVPVSLCKHSLLEEVSISHNQLSGSIPRECGGLPHLQSLDFSYNSINGTIPDSF 307 

Query: 727 SNLSSLTLLNAENNLLDNQIPQSLGRLRNLSVLILSRNQFSGHIPSSIANISSLRQLDLS 906 

SNLSSL LN E+N L IP ++ RL NL+ L L RN+ +G IP +1 NIS +++LDLS 
Sbjct: 308 SNLS S LVSLNLESNHLKGPI PD AIDRLHNLTELNLKRNKINGP I PETIGNISGI KKLDLS 3 67 

Query: 907 LNNFSGEIPVSFDSQRSLNLFNVSYNSLSGSVPPLL71KKFNSSSFVGNIQLCGYSPSTPC 1086 

NNF+G IP+S L+ FNVSYN+LSG VPP4-L+KKFNSSSF+GNIQLCGYS S PC 

Sbjct: 368 ENNFTGPIPLSLVHLAKLSSFNVSYNTLSGPVPPVLSKKFNSSSFLGNIQLCGYSSSNPC 427 

Query: 1087 LSQAPSQGVIAPP- -PEVSKHHHHRKLSTKDIILIVAGVLLWLIILCCVLLFCLIRICRS 1260 

+ + P + + HHHRKLS KD+ILI G LL +L++LCC+LL CLI+KR+ 

Sbjct: 428 PAPDHHHPLTLSPTSSQEPRKHHHRKLSVKDVILIAIGALLAILLLLCCILLCCLIKKRA 487 

Query: 1261 TSRPGTAKPPEGRAATMRTEKGVPPVAGGDVEAGGEAGGKLVHFDGPMAFTADDLLCATA 1440 

K +G+ T +EK V G AGGE GGKLVHFDGP FTADDLLCATA 

Sbjct: 488 ALKQKDGKDKT- - SEKTVSAGVAGTASAGGEMGGKLVHFDGPFVFTADDLLCATA 540 

Query: 1441 EIMGKSTYGTVyKAILEDGSQVAVKRLREKITKGHREFESEVSVLGKIRHPNVLALRAYY 1620 

EIMGKSTYGT YKA LEDG+ +VAVKRLREK TKG +EFE EV+ LGKIRH N+LALRAYY 
Sbjct: 541 EIMGKSTYGTAYKATLEDGNEVAVKRLREKTTKGVKEFEGEVTALGKIRHQNLIx^RA^ 600 

Query: 1621 LGPKGEKLLGFD 1656 

LGPKGEKLL FD 
Sbjct: 601 LGPKGEKLLVFD 612 
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Score = 185 bits (464) , Expect « le-45 

Identities = 93/161 (57%) , Positives » 122/161 (75%) , Gaps * 3/161 (1%) 
Frame = +2 



Query: 1943 GLVCLHSQENT IHGTSHPAMCGLMKNKC*NS DFGLFRVDVNCC * FQRDS YSWS IGYR 2113 

GL LHS EN+IH + ++ ++ N+ D+GL R+ + + ++GYR 

Sbjct: 647 GLAHLHSNENMIH - - ENLTASNILLDEQTNAHIADYGLSRLMTAAAATNVI ATAGTLGYR 704 

Query: 2114 APELSKLKKANTKTDIYSLGVILLELLTRKSPGVSMNGLDLPQWASVVKEEWTNEVFDA 2293 

APE SK+K A+ KTD+YSLG+I+LEUiT KSPG NG+DLPQWVAS + VKEEWTNEVFD 
Sbjct: 705 APEFS KI KNAS AKTDVYSLGI I ILELLTGKS PGEPTNGMDLPQWVAS I VKEEWTNEVFDL 764 

Query: 2294 D LMRDAS TVGDE LLNTLKL ALHCVD P S P S ARPE VHQVLQQLKRL 2425 

+LMR+ +VGDELLNTLKLALHCVDPSP+ARPE +QV++QL+ + 
Sbjct: 765 ELMRETQSVGDELLNTLKLALHCVDPSPAARPEANQVVEQLEEI 808 
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